Consistency in a Stream Warehouse

نویسندگان

  • Lukasz Golab
  • Theodore Johnson
چکیده

A stream warehouse is a Data Stream Management System (DSMS) that stores a very long history, e.g. years or decades; or equivalently a data warehouse that is continuously loaded. A stream warehouse enables queries that seamlessly range from realtime alerting and diagnostics to long-term data mining. However, continuously loading data from uncontrolled sources into a realtime stream warehouse introduces a new consistency problem: users want results in as timely a fashion as possible, but “stable” results often require lengthy synchronization delays. In this paper we develop a theory of consistency for stream warehouses that allows for multiple consistency levels, we show how to restrict query answers to a given consistency level, and we show how warehouse maintenance can be optimized using knowledge of the consistency levels required by materialized views.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Strobe Algorithms for Multi-Source Warehouse Consistency

A warehouse is a data repository containing integrated information for e cient querying and analysis. Maintaining the consistency of warehouse data is challenging, especially if the data sources are autonomous and views of the data at the warehouse span multiple sources. Transactions containing multiple updates at one or more sources, e.g., batch updates, complicate the consistency problem. In ...

متن کامل

Consistency Algorithms for Multi - SourceWarehouse

A warehouse is a data repository containing integrated information for eecient querying and analysis. Maintaining the consistency of warehouse data is challenging, especially if the data sources are autonomous and views of the data at the warehouse span multiple sources. Transactions containing multiple updates at one or more sources, e.g., batch updates, complicate the consistency problem. In ...

متن کامل

Multiple View Consistency for Data Warehousing

A data warehouse stores integrated information from multiple distributed data sources. In effect, the warehouse stores materialized views over the source data. The problem of ensuring data consistency at the warehouse can be divided into two components: ensuring that each view reflects a consistent state of the base data, and ensuring that multiple views are mutually consistent. In this paper w...

متن کامل

بهبود به‌روزرسانی پایگاه داده تحلیلی نیمه‌آنی

Near-real time data warehouse gives the end users the essential information to achieve appropriate decisions. Whatever the data are fresher in it, the decision would have a better result either. To achieve a fresh and up-to-date data, the changes happened in the side of source must be added to the data warehouse with little delay. For this reason, they should be transformed in to the data wareh...

متن کامل

Towards the Stream Analysis Model in Grid-based Zero-Latency Data Stream Warehouse

Recent emerging applications increasingly generate continuous, larger amounts of valuable data. The demand of conducting advanced analysis over fast and huge data streams to capture trends, patterns, and exceptions become crucial. However, fully extracting the latent knowledge within the data stream is a challenging task because of insufficient technology. While Data Warehouse (DWH) technologie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011